-
-
Notifications
You must be signed in to change notification settings - Fork 11.5k
[Rocm] [quantization] Fix quark ptpc moe and add test case #24649
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Rocm] [quantization] Fix quark ptpc moe and add test case #24649
Conversation
Co-authored-by: Haoyang Li <[email protected]> Signed-off-by: Haoyang Li <[email protected]>
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run You ask your reviewers to trigger select CI tests on top of Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add If you have any questions, please reach out to us on Slack at https://slack.vllm.ai. 🚀 |
|
hi, @mgoin ,@robertgshaw2-redhat |
|
hi, good morning. |
…ect#24649) Signed-off-by: Haoyang Li <[email protected]> Co-authored-by: Haoyang Li <[email protected]>
…ect#24649) Signed-off-by: Haoyang Li <[email protected]> Co-authored-by: Haoyang Li <[email protected]> Signed-off-by: charlifu <[email protected]>
…ect#24649) Signed-off-by: Haoyang Li <[email protected]> Co-authored-by: Haoyang Li <[email protected]>
…ect#24649) from upstream [Rocm] [quantization] Fix quark ptpc moe and add test case (vllm-project#24649)
…ect#24649) Signed-off-by: Haoyang Li <[email protected]> Co-authored-by: Haoyang Li <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
…ect#24649) Signed-off-by: Haoyang Li <[email protected]> Co-authored-by: Haoyang Li <[email protected]>
…ect#24649) Signed-off-by: Haoyang Li <[email protected]> Co-authored-by: Haoyang Li <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
1.Add support for loading quark's ptpc-format moe models
2.add test case for quark ptpc